Unsupervised Measurement of Translation Quality Using Multi-engine, Bi-directional Translation
نویسندگان
چکیده
Lay people discussing machine translation systems often perform a round trip translation, that is translating a text into a foreign language and back, to measure the quality of the system. The idea behind this is that a good system will produce a round trip translation that is exactly (or perhaps very close to) the original text. However, people working with machine translation systems intuitively know that round trip translation is not a good evaluation method. In this article we will show empirically that round trip translation cannot be used as a measure of the quality of a machine translation system. Even when using translations of multiple machine translation systems into account, to reduce the impact of errors of a single system, round trip translation cannot be used to measure machine translation quality.
منابع مشابه
Translation Lexicon Estimates from Non-Parallel Corpora Pairs
The estimation of translation lexicon probabilities from parallel corpora is well studied in statistical machine translation. Whenever parallel corpora are not available, it is still possible to obtain unsupervised estimates from pairs of monolingual, non-parallel corpora. In both cases the standard estimator is the Expectation-Maximization (EM) that aims at increasing the likelihood of the sou...
متن کاملUnsupervised Extraction of False Friends from Parallel Bi-Texts Using the Web as a Corpus
False friends are pairs of words in two languages that are perceived as similar, but have different meanings, e.g., Gift in German means poison in English. In this paper, we present several unsupervised algorithms for acquiring such pairs from a sentence-aligned bi-text. First, we try different ways of exploiting simple statistics about monolingual word occurrences and cross-lingual word co-occ...
متن کاملSpeech-to-Speech Translation Software on PDAs for Travel Conversation
We present an automatic speech-to-speech translation system for Personal Digital Assistants (PDAs) that helps oral communication between Japanese and English speakers in various situations while traveling. Our own compact large-vocabulary continuous speech recognition engine and compact translation engine based on a lexicalized grammar provided the basis for the Japanese/English bi-directional ...
متن کاملA Recurrent Neural Networks Approach for Estimating the Quality of Machine Translation Output
This paper presents a novel approach using recurrent neural networks for estimating the quality of machine translation output. A sequence of vectors made by the prediction method is used as the input of the final recurrent neural network. The prediction method uses bi-directional recurrent neural network architecture both on source and target sentence to fully utilize the bi-directional quality...
متن کاملInitial Results on the F-logic to OWL Bi-directional Translation on a Tabled Prolog Engine
In this paper, we show our results on the bi-directional data exchange between the F-logic language supported by the Flora2 system and the OWL language. Most of the TBox and ABox axioms are translated preserving the semantics between the two representations, such as: proper inclusion, individual definition, functional properties, while some axioms and restrictions require a change in the semant...
متن کامل